Session 3️⃣: Data Donation Studies (Researcher Perspective)
Frieder Rodewald (University of Mannheim) & Valerie Hase (LMU Munich)
👉 Part of the SPP DFG Project Integrating Data Donations in Survey Infrastructure
What are methodological decisions researchers have to take in data donation studies? 🤔
Figure. Data donation study - researcher perspective
Research design & tool set-up
Data cleaning & augmentation, including
📢 Task 3: Classify search terms
Modelling digital traces
Image by Hope House Press via Unsplash
Source: Image by Markus Winkler via Unsplash
Figure. Data donation study - researcher perspective
Key decisions:
Key decisions:
This may sound silly but:
Key decisions:
Choose a tool, e.g., …
Figure. Filtering data - File extraction
Figure. Filtering data - Python code
Figure. Filtering data - Python code
Figure. Anonymization - Example of Whitelists
Figure. Example of anonymized data
Figure. Aggregation - Python code
Figure. Data deletion
This is how much “fun” testing DDTs is:
Figure. Github issues - Testing the tool
Key issues 🚨 (Hase et al., 2024)
Let’s have a look at the technical set-up 💻:
Running the DDT locally
Key decisions:
Low response rates (e.g., Hase & Haim, 2024; Keusch et al., 2024)
Non-response bias
Primary used in non-probability panels (e.g. online access panels)
Survey design strategies: For now, 🤑 is the only thing that works.
👉 Again, we will talk about this in session 4️⃣.
Figure. Data donation study - researcher perspective
Figure. Data donation study - researcher perspective
This is how your data may look like:
Figure. Donated data - example
This is how your data may look like:
Figure. Donated data - example
📢 Task 3: Classify search terms
Download the data for Task 4 from the workshop website. This contains YouTube searches collected from a German social media sample. Either discuss this (no-code group) or do this in R/Python (code group)…..
How you would clean the data?
How you would identify health-related searches using NLP methods?
Figure. Donated data - example
👉 You know the drill: We will talk about this in session 4️⃣.
Figure. Data donation study - researcher perspective
Figure. Data donation study - researcher perspective
Think carefully about…
Questions? 🤔
Data Donation Studies - COMPTEXT - Frieder Rodewald, Valerie Hase